Feature Selection for Medical Dataset Using Rough Set Theory

نویسندگان

  • Yan WANG
  • Lizhuang MA
چکیده

Rough set approach has been recognized to be one of the powerful tools in medical feature selection. Many feature selection methods based on rough set have been proposed, where numerous experimental results have demonstrated that these methods based on discernibility matrix are efficient. However, the high storage space and the time-consuming computation restrict its application. In this paper, we propose an efficient algorithm called as Feature Forest algorithm for generation of the reducts of a medical dataset. In the algorithm, the given dataset is transformed into a forest to form discernibility string that is the concatenation of some of features and the disjunctive normal form is computed to reduct features based on feature forest. In addition, experimental results on different datasets show that the algorithms of this paper can efficiently reduce storage cost and be computationally inexpensive. Key-Words: Rough set theory, disjunctive normal form, feature selection, feature forest

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diagnosis of the disease using an ant colony gene selection method based on information gain ratio using fuzzy rough sets

With the advancement of metagenome data mining science has become focused on microarrays. Microarrays are datasets with a large number of genes that are usually irrelevant to the output class; hence, the process of gene selection or feature selection is essential. So, it follows that you can remove redundant genes and increase the speed and accuracy of classification. After applying the gene se...

متن کامل

A hybrid filter-based feature selection method via hesitant fuzzy and rough sets concepts

High dimensional microarray datasets are difficult to classify since they have many features with small number ofinstances and imbalanced distribution of classes. This paper proposes a filter-based feature selection method to improvethe classification performance of microarray datasets by selecting the significant features. Combining the concepts ofrough sets, weighted rough set, fuzzy rough se...

متن کامل

Feature Selection and Classification of Intrusion Detection System Using Rough Set

With the expansion of computer network there is a challenge to compete with the intruders who can easily break into the system. So it becomes a necessity to device systems or algorithms that can not only detect intrusion but can also improve the detection rate. In this paper we propose an intrusion detection system that uses rough set theory for feature selection, which is extraction of relevan...

متن کامل

Attribute Reduction using Forward Selection and Relative Reduct Algorithm

Attribute reduction of an information system is a key problem in rough set theory and its applications. Rough set theory has been one of the most successful methods used for feature selection. Rough set is one of the most useful data mining techniques. This paper proposes relative reduct to solve the attribute reduction problem in roughest theory. It is the most promising technique in the Rough...

متن کامل

Sentiment Classification using Rough Set based Hybrid Feature Selection

Sentiment analysis means to extract opinion of users from review documents. Sentiment classification using Machine Learning (ML) methods faces the problem of high dimensionality of feature vector. Therefore, a feature selection method is required to eliminate the irrelevant and noisy features from the feature vector for efficient working of ML algorithms. Rough Set Theory based feature selectio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009